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Protein glycosylation with O-linked A'-acetylglucosamine 
(O-GlcNAc) is a post-translational modification of serine/ 
threonine residues in nucleocytoplasmic proteins. O-GlcNAc 
has been shown to play a role in many different cellular 
processes and O-GlcNAcylation is often found at sites that 
are also known to be phosphorylated. Unlike phosphoryl- 
ation, O-GlcNAc levels are regulated by only two enzymes, 
O-GlcNAc transferase (OGT) and O-GlcNAc hydrolase 
(O-GlcNAcase or OGA). So far, no obvious consensus 
sequence has been found for sites of O-GIcNAcylation. 
Additionally, O-GlcNAcase recognizes and cleaves all 
O-GlcNAcylated proteins, independent of their sequence. 
In this work, we generate and analyze five models of 
O-GlcNAcylated peptides in complex with a bacterial OGA. 
Each of the five glycopeptides bind to OGA in a similar 
fashion, with OGA-peptide interactions primarily, but not ex- 
clusively, involving the peptide backbone atoms, thus explain- 
ing the lack of sensitivity to peptide sequence. Nonetheless, 
differences in peptide sequences, particularly at the -1 to -4 
positions, lead to variations in predicted affinity, consistent 
with observed experimental variations in enzyme kinetics. 
The potential exists, therefore, to employ the present analysis 
to guide the development glycopeptide-specific inhibitors, or 
conversely, the conversion of OGA into a reagent that could 
target specific O-GlcNAcylated peptide sequences. 
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Introduction 

Since its discovery (Torres and Hart 1984), the post-translational 
modification of proteins by P-O-Unked A'-acetylglucosamine 
(0-GlcNAc) on serine or threonine residues has generated signifi- 
cant interest in the field of glycobiology. The modification occurs 
on nuclear and cytosolic proteins, many of which modulate cell sig- 
naling (Hanover et al. 2010; Slawson et al. 2010). It is also thought 
that deregulation of this modification may be involved in neurode- 
generative diseases, cancer and diabetes mellitus (Hanover 2001; 
Hart et al. 2007). Additionally, 0-GlcNAc rapidly cycles on and off 
proteins on a time scale similar to that of phosphorylation/depho- 
sphorylation, and thus an interplay has been shown to exist between 
protein phosphorylation and 0-GlcNAcylation (Cheng et al. 2000; 
Comer and Hart 2000; Kamemura et al. 2002; Zeidan and Hart 
2010). Unlike phosphorylation, which is controlled by numerous 
kinases and phosphatases, the addition and removal of 0-GlcNAc 
is controlled by only two enzymes. 0-GlcNAc transferase (OGT) is 
responsible for the transfer of GlcNAc Irom UDP-GlcNAc to the 
protein, whereas 0-GlcNAcase (OGA) removes it. Despite the im- 
portance of this modification, very little is known about the struc- 
tural relationship between these enzymes and their substrates. Nor 
is it clear, how only a single hydrolase is capable of removing 
O-GlcNAc fix)m all proteins, independent of the protein sequence. 

To gain an understanding of the molecular interactions re- 
sponsible for the ability of OGA to tolerate multiple substrate 
sequences, we modeled the structure of bacterial OGA, 
Bactei-oide,s thetaiotaomicron GH84 (5 /OGA) in complex with 
five 0-GlcNAcylated peptides of biological interest. Notably, the 
OGAs have been grouped together into the same glycoside 
hydrolase family GH84 (Henrissat and Davies 1997) based on 
the similarity of their N-terminal sequence domains. Specifically, 
fizOGA and human OGA (hOGA) share an overall sequence 
identity of >40% (Dennis et al. 2006) with the same overall 3D 
staicture, as such the details found from the BlOGA model can 
be applied to hOGA for the development of therapeutic reagents. 

Five glycopeptides, 10 residues in length, were built from 
each of the following 0-GlcNAcylated systems: the tumor suppres- 
sor protein p53 (0-GlcNAcylated at Serl49 (Yang et al. 2006)), the 
DNA packaging protein Histone H3 (0-GlcNAcylated SerlO 
(Zhang et al. 2011)), a transforming growth factor binding partner 
T4B1 (0-GlcNAcylated at Ser395 (Pathak et al. 2012)), as well as 
c-Myc, a regulator of gene transcription (0-GlcNAcylated at Thr58 
(Chou et al. 1995)). Additionally, a synthetic 0-GlcNAcylated 
polyalanine (poly- Ala') peptide was included as a reference. 
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It has been shown that OGA uses a substrate-assisted catalytic 
mechanism, in which the A'-acetyl group of 0-GlcNAc adopts a 
j^M-confomiation, in which the amide proton eclipses the proton 
at C2, which aids the sugar to fonn the oxune transition state 
(Dennis et al. 2006; Macauley et al. 2005). Several inhibitors of 
OGA have been reported (Kim et al. 2006; Whitworth et al. 2006; 
He et al. 2009; Macauley and Vocadlo 2010), and have been 
co-crystalized with the enzyme. For this study, a model for the 
conformation of 0-GlcNAc in the binding site of fi/OGAwas gen- 
erated based on the atomic coordinates of the inhibitor PUGNAc, 
a transition state mimic and 0-GlcNAc analog (Perreira et al. 
2006). The peptides were then conjugated to the bound monosac- 
charide through a combination of molecular docking and molecu- 
lar dynamics (MD) simulation. 

To provide a structural and dynamic interpretation of the 
observed ability of OGA to deglycosylate a variety of glycopep- 
tides, each of the complexes was subjected to fully solvated MD 
simulations (350 ns), followed by direct AG energy calculations. 
We have focused on the conformational properties of these com- 
plexes, rather than on the mechanism of deglycosylation, as we 
are particularly interested in the ability of the enzyme to recognize 
0-GlcNAc in a variety of protein contexts. We compared the con- 
formation of the bound 0-GlcNAcylated peptides to a recently 
solved bacterial OGA homolog from Clostridium perfringens 
(CpOGA) in complex with an eight residue 0-GlcNAcylated p53 
peptide (PDBid 2YDR) (Schunpl et al. 2012). In addition, p53 
(PDBid ITUP) (Cho et al. 1994) is the only structurally character- 
ized protein where the 0-GlcNAcylation site is positioned in a 
flilly defined loop, thus allowing us to flirther compare our results 
(Schimpl et al. 2012). 



Results 

Structural stability of the JitOGA—glycopeptide complexes 
Over the course of the 350 ns MD simulations, performed with 
no restraints on either the protein or glycopeptides, the protein 
fold remained stable; the Ca root-mean-square deviation 
(RMSD) of the BtOGA in all cases reached an equilibrium 
value of between 0.84 and 0.93 A (c-Myc': 0.93 A, histone 
H3': 0.84 A, p53': 0.88 A, poly-Ala'; 0.91 A and TABl'; 0.84 
A). The glycopeptides in all cases remained bound to the 
protein, with the GlcNAc adopting equivalent orientations. The 
ring conformation in all complexes with fi/OGAwas monitored 
over the course of the MD simulations. The analysis showed 
that in GlcNAc-O-methyl (OMe), GlcNAc-p53, GlcNAc-Histone 
H3 and GlcNAc-TAB 1 the ring adopts conformations between 
'S3 and ''"^B. In GlcNAc-c-Myc, the sugar ring adopts an 
conformation, while in GlcNAc-poly-Ala' the ring is more flex- 
ible, adopting two conformations H3 (45%) and 'S3 to ''^B 
(55%). In the QjOGA-glycopeptide complexes (Schimpl et al. 
2012), the ring adopts conformations between 'S3 and ''''B. 
The Cremer-Pople pucker parameters (Cremer and Pople 1975) 
and cannonical ring fonns are presented in Supplementary data. 
Table SI and Figure S2. In addition, the dihedral angles of the 
GlcNAc-Ser linkages (<E>, and x) well as the NAc con- 
fonnation were monitored throughout the simulations. The ana- 
lysis showed that the GlcNAc-Ser linkages, in each of the five 
glycopeptides, populated comparable rotamers, whilst the NAc 
group remained stable in the .vy«-orientation throughout (Table I). 



Additionally, the average dihedral values for these angles in the 
simulated p53' and TAB 1 '-glycopeptide complexes were in close 
agreement with those seen in the crystal structures of CpOGA 
complexed with these ligands (Table I). 

GlcNAc: A comparison of the hydrogen bond interactions 
between the GlcNAc and OGA residues for GlcNAc-OMe as 
well as all five glycopeptides is presented in Supplementry 
data. Table SII. Hydrogen bonds were found between six OGA 
residues and the glycan. Of those, five involved sidechain interac- 
tions (D344, N339, D242, K166 and H433), whilst the remaining 
was through a backbone interaction involving OGA residue G135. 
The interactions between the GlcNAc residue and OGA, in both 
the p53' and TABl' — BtOGA glycopeptide complexes, were in 
good agreement with those seen in the recently reported crystal 
sfructures for the corresponding glycopeptides in complex with 
Q?OGA (Schimpl et al. 2012) (Supplementry data. Table SII). 
The similarity of the crystallographic and modeled complexes pro- 
vided independent validation of the computational approach. 

Poly-Ala': The hydrogen bond analysis of the BtOGA — 
poly-Ala' complex showed that the most stable hydrogen bond 
(26% occupancy) was formed between the sidechain of OGA 
residue D243 with the backbone of Ala at the +2 position (Ala 
(+2)) of the glycopeptide (positional numbering is relative to 
the site of glycosylation. Figure lA). Lower occupancy hydro- 
gen bonds were observed between D243 and Ala(+1) position, 
and OGA Y 137 and Ala(-2) (see Table II). It is notable that no 
long-lived hydrogen bonds were seen between poly-Ala' and 
OGA. 

c-Myc': In the case of the c-Myc' glycopeptide, the most 
stable intermolecular hydrogen bond (31.5% occupancy) was 
observed between the backbone of L56(-2) and OGA residue 
D243, but unlike the case of poly- Ala', here the interaction 
involved only the backbone of D243. Weak (16.8% occupancy) 
backbone-only interactions were also seen between E54(-4) and 
OGA residue S245. The remaining hydrogen bonds are almost 
exclusively through peptide backbone interactions and OGA; 
none of these were present for over 8.1%) of the simulation time. 

Histone H3': The strongest intermolecular hydrogen bond 
(31.1% occupancy) in the histone H3' glycopeptide-OGA complex 
was seen between the backbone of G12(+2) of the glycopeptide 
and the sidechain of OGA residue D243. This interaction and 
occupancy is analogous to that seen in the case of poly-Ala' 
complex. The second strongest hydrogen bond (25% occupancy) 



Table I. Average dihedral angles for GlcNAc-Ser/Thr linkages in each of the 
five glycopeptide-B/OGA complexes 





0' 




x' 


NAc" 


c-Myc' 


-51.3 (8) 


168.3(9) 


-57.1 (8) 


-67.2 (7) 


Histone H3' 


-78.2(10) 


-176.8(7) 


-63.7(8) 


-64.1 (9) 


p53' 


-73.3 (9) 


-174.0(8) 


-57.8(9) 


-59.2 (9) 


p53 X-ray'' 


-74.6 


173.2 


-52.4 


-52.7 


Poly-Ala' 


-74.8(13) 


176.2(18) 


-63.7(11) 


-63.3 (9) 


TABl' 


-86.9(10) 


173.7(8) 


-64.3 (9) 


-62.1 (8) 


TABl X-ray'' 


-67.8 


165.8 


-52.8 


-56.5 



= 05-C 1 -Oy-Cp, T = C 1 -Oy-CP-Ca, x = Oy-Cp-Ca-N, in degrees, with 
standard deviations in parentheses. 

''From the p53 and TAB 1 glycopeptide-CpOGA complexes (Schimpl et al. 
2012). 

"Dihedral angle C1-C2-N-C. 
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A B C 




Fig. 1. (A) Five polypeptides were built in extended conformations using the Maestro builder from Schrodinger. Sites of glycosylation are indicated with a star and 
are positioned at Thr58 (c-Myc), SerlO (Histone H3), Serl49 (p53), Ser5 (poly-Ala') and Ser395 (TABl). (B) The top docked conformation of each peptide was 
used as a starting point for MD. (C) Ribbon representation of the peptide structures from the 350 ns simulation aligned onto the GlcNAc residue from the bound p53 
glycopeptide - CpOGA crystal structure. The snapshots have been selected every 3.5 ns. The peptides are coloured as follows; c-Myc' (purple), Histone H3' (red), 
p53' (blue), poly-Ala' (green), TABl' (cyan), with the p53-crystal glycopeptide in yellow. 



was observed between the backbone atoms of R8(-2) and the 
sidechain of OGA residue R347. Additional weak hydrogen 
bonds were observed between D167, Y137, H433 and D243 of 
OGA and the peptide; stable for a maximum of 19.2% of the 
time. 

p53': Glycopeptide p53' formed a weak intermolecular hydro- 
gen bond (32% occupancy) between the sidechain of W146(-3) 
and Y550 of OGA. When compared with the recent crystallo- 
graphic data of the p53 glycopeptide-CpOGA complex (Schimpl 



et al. 2012), it was found that this hydrogen bond appears to 
replace the stacking interaction of W146(-3) with protein residue 
N298 {BtOGA N243). A comparison, of the crystallographic and 
MD-derived conformations of the p53 glycopeptides with the 
crystal structure of the full-length p53 protein (PDBid ITUP) 
(Cho et al. 1994), indicated that the W146 orientation from the 
MD results was in good agreement with the conformation seen in 
the intact p53 protein (Figure 2). The backbone conformation of 
the MD-derived p53' glycopeptide gave rise to an average RMSD 
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Table II. Intermolecular hydrogen bond occupancies between BtOGA and the 
peptide moieties for each of the five glycopeptide-OGA complexes 



Donor H-X Acceptor Occupancy 



c-Myc' 



L(-2)-Main 


D243-Main 


31.5 


S245-Main 


E(-4)-Main 


16.8 


E(-4)-Main 


S245-Main 


8.1 


H433-Side 


T*(0)-Main 


7.5 


P(+1)-Main 


D243-Side 


7.1 


S245-Side 


E(-4)-Side 


6.5 


Histone H3' 






G(+2)-Main 


D243-Side 


31.1 


R347-Side 


R(-2)-Main 


25.4 


R(-2)-Side 


D167-Side 


19.2 


Y137-Side 


S*(0)-Side 


13.5 


H433-Side 


S*(0)-Main 


10.9 


T(+1)-Main 


D243-Side 


5.4 


p53' 






W(-3)-Side 


Y550-Side 


31.9 


H433-Side 


T(+1)-Side 


18.3 


L(-4)-Main 


D243-Main 


14.8 


Y137-Side 


S*(0)-Side 


8.3 


W(-3)-Side 


K166-Main 


7.8 


V(-2)-Main 


Y137-Side 


6.7 


T(+1)-Side 


H433-Side 


5.9 


Poly-Ala' 






A(+2)-Main 


D243-Side 


26.0 


A(+1)-Main 


D243-Side 


5.5 


A(-2)-Main 


Y137-Side 


5.4 


TABl' 






A(+2)-Main 


D243-Side 


24.3 


H433-Side 


Y(-1)-Side 


7.9 


Q(+3)-Side 


W286-Side 


6.9 


Y137-Side 


S*(0)-Side 


6.7 



''Based on a maximum donor-acceptor distance of 3.5 A with a minimum 
occupancy of 5%. 



of 1 .4 A relative to the same region in the intact protein, whereas 
the data from CpOGA crystal structure resulted in an RMSD 
value of 2.2 A. Other OGA residues predicted by MD to make 
hydrogen bonds to the p53' peptide include H433, D243, Y137 
and K166; however, these were again short lived. 

TABl ': As seen in the case of poly- Ala' , in TAB 1 ' the most 
stable intermolecular hydrogen bond (24.3% occupancy) was 
again a peptide backbone interaction, specifically between 
A397(+2) and the sidechain of OGA residue D243. No other 
interactions existed for >8% of the total simulation time. 

Structure and dynamics of the bound glycopeptides 
To assess the presentation and conformation of each of the gly- 
copeptides, obtained from the MD simulations, with that seen 
in the crystal structure of the p53 glycopeptide-QjOGA 
complex (Schimpl et al. 2012), the glycan atoms in the glyco- 
peptides were aligned onto the crystallographic coordinates of 
GlcNAc in the complex (Figure IC). This alignment indicated 
that, in all cases, despite the difference in sequences, each 
peptide backbone adopted a similar "V"-shaped confomiation; 
comparable with that seen for glycopeptide fragments of p53, 
TABl and hOGA in complex with Q?OGA (Schimpl et al. 
2012). This "V"-shaped conformation is likely stabilized by a 
combination of intermolecular interactions, between the pep- 
tides and those BtOGA residues identified from both the 




Fig. 2. Ribbon representation of both, the most populated p53' glycopeptide 
(blue) and the p53 glycopeptide from the CpOGA complex 2YDR (yellow), 
were aligned onto the corresponding sequence (Leu 145 to Gly 151) in the p53 
protein crystal (PDBid ITUP) (grey). Trp 146 of p53 (-3 position in the 
glycopeptides) is shown in stick with the GlcNAc residue removed for clarity. 

hydrogen bond and the binding interaction analyses, as well as 
intramolecular bonds within the glycopeptide. In addition to 
the intennolecular interaction, discussed previously, an analysis 
of intramolecular interactions showed that all glycopeptides, 
with the exception of poly- Ala' , form, primarily, but not exclu- 
sively, mainchain hydrogen bonds between residues at the —1 
or -2 positions with opposing residues at the +1, +3 or +4 
positions in the glycopeptide (see Supplementary data. 
Table SIII). Specifically, in the complex of BtOGA with the 
p53'-glycopeptide, a hydrogen bond was observed (3.0 A, 76% 
occupancy) between the carboxylate of D(- 1 ) and the main- 
chain nitrogen atom of T(+l). A similar interaction has been 
reported in the crystal structure of the p53 glycopeptide with 
Q?OGA (2.5 A) (Schimpl et al. 2012). That this interacfion 
fonned spontaneously in the MD data, which was initiated in- 
dependent of the C/'OGA-p53 complex, adds to the credibility 
of the general approach taken here to model the fi/OGA-glyco- 
peptide complexes. Similarly, a bond was observed (2.9 A, 
53% occupancy) in the TAB I'-glycopeptide complex between 
the mainchain carbonyl oxygen atom in Y(-l) and the side- 
chain hydroxyl in S(+l) of the glycopeptide. Interestingly, this 
interaction is not seen in the crystal structure of the TABl -gly- 
copeptide in complex with CpOGA (average distance 6.7 A) 
(Schimpl et al. 2012); most likely due to the glycopeptide ter- 
minating at Ser at the +1 position, as opposed to being part of a 
longer peptide (10 residues) in our model. In the Histone H3' 
glycopeptide, a weak hydrogen bond was seen (3.0 A, 20% oc- 
cupancy) between the -1 position (K, mainchain N) and the -4 
position (T, mainchain carbonyl). Similarly, in the c-Myc' gly- 
copeptide, a weak hydrogen bond was predicted to exist (3.1 A, 
26% occupancy) between the backbone atoms at the -2 pos- 
ition (L, mainchain carbonyl) and the +3 position (L, main- 
chain N). In contrast to the cases of p53' and TABl', in which 
the hydrogen bond spans only the glycosylated residue, in 
Histone H3' and c-Myc', the hydrogen bond spans three resi- 
dues, presumably weakening the interaction in the latter cases. 
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With regard to intramolecular glycan-peptide interactions, it is 
notable that in each case a hydrogen bond was formed between 
the hydroxymethyl group (GlcNAc-06) and the backbone of 
the glycosylated serine (Ser-N). This hydrogen bond (3.4 A) 
was also seen in the p53' glycopeptide-Q?OGA complex 



(Schimpl et al. 2012). GlcNAcylation via threonine, as in 
c-Myc', appears to abolish this interaction. 

An analysis of the RMSD of the peptide backbones was per- 
formed to quantify the conformational distribution for each 
peptide (Figure 3A), relative to the initial structure in the MD 



cc 




0 2000 4000 6000 8000 10000 12000 14000 16000 

Frame (20ps/frame) 




0 2000 4000 6000 8000 10000 12000 14000 16000 

Frame (20ps/frame) 



Fig. 3. Backbone atom (CA, C, O, N) RMSD values for each peptide throughout the 350ns MD as a function of time. (A) Average I^MSD of each peptide relative to 
its starting confonnation. (B) Each peptide relative to the backbone conformation of the p53 glycopeptide conformation from the crystal structure (PDBid 2YDR). 
The RMSD values are coloured as follows; c-Myc' (purple), Histone H3' (red), p53' (blue), poly-Ala' (green), TABl' (cyan). 
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simulations. This analysis indicated that Poly-Ala' was the most 
flexible glycopeptide, with the average RMSD values being: 1.0 
A (poly-Ala'), 0.8 A (c-Myc'), 0.6 A (Histone H3'), 0.6 A (p53') 
and 0.5 A (TABl'). The RMSD analysis (Figure 3B), relative to 
the confomiation of the p53-glycopeptide in complex with 
Q?OGA (Schimpl et al. 2012), indicated that the backbone con- 
formations of each of the glycopeptides are similar: 2.6 A 
(c-Myc'), 1.9 A (Histone H3'), 2.0A (p53'), 1.5 A (poly-Ala') 
and 1.8 A (TABT). Additionally, to identity similar conform- 
ational features sampled during the simulation of each glycopep- 
tide, a clustering analysis was performed on the five MD data 
sets. The cluster analysis revealed that the top three clusters from 
each peptide cover over 75% of the total conformational space 
sampled. Of all of the glycopeptides, poly-Ala' and c-Myc' dis- 
played the highest degree of structural diversity significantly 
populating at least two clusters (see Table III), whereas all of the 
other glycopeptides could be characterized by a single dominant 
cluster. The RMSD of each peptide, from the most populated 
cluster, relative to the conformation of the p53-glycopeptide in 
complex with CpOGA (Schimpl et al. 2012), was comparable 
with those seen in the average RMSD analyses, and also showed 
that the peptide backbone conformations are aU within 1 .7 A of 
each other (Table III). 

Structure and dynamics of the free glycopeptides 
To gain an understanding of the extent to which the free glyco- 
peptides in solution exist in conformations appropriate for 
binding to the BtOGA, independent MD simulations (500 ns) 
were performed on each of the five glycopeptides. An analysis 
of the intramolecular hydrogen bonds in each system indicated 



Table III. Clustering analysis'" for eacii of tlie five glycopeptides 



Cluster 


Cluster population 


RMSD 


c-Myc 
1 


56 


2.9 


2 


30 


2.3 


3 


6 


3.0 


Total 


91 




Histone H3 






1 


79 


1.9 


2 


9 


1.7 


3 


5 


2.0 


Total 


92 




p53' 
1 


92 


1.9 


2 


5 


2.2 


3 


2 


2.2 


Total 


99 




Poly-Ala' 
1 


36 


1.4 


2 


27 


1.4 


3 


12 


1.3 


Total 


75 




TABl' 






1 


91 


1.8 


2 


5 


1.5 


3 


2 


1.5 


Total 


98 





"Relative to the backbone conformation of the p53 glycopeptide in complex 
with CpOGA (Schimpl et al. 2012) and the structure with the median RMSD in 
the cluster Populations in percent, distances in Angstroms. 



that p53' and TABl' displayed transient interactions for ~29% 
of the simulations (see Supplementary data. Table SIV). The 
other glycopeptides appeared to be even less-ordered. The 
interactions within p53' were similar to those seen in the bound 
peptide, whereas in TABl' they were unique to the free peptide. 
Thus, only in the case of p53' does the free peptide appear to 
populate to some extent conformations comparable with those 
seen in the bound state. It is notable also that intramolecular 
interactions between the glycan and residues at the +1 or -1 
positions were observed in the free glycopeptides, similar to 
those seen in the bound forms, but that even in the case of p53' 
these interactions were transient. 

To further quantify the conformational similarities between 
the bound and free glycopeptides, the bound conformation of 
each peptide, as extracted from the dominant cluster, was com- 
pared with the solution conformations. The resulting RMSD plots 
for the free glycopeptides, relative to the bound conformaiton, in- 
dicate that p53' exhibits relatively low RMSD values (of ~2^ 
A), further supporting the idea that the p53' glycopeptide exists in 
a conformation similar to that of the bound (Supplementry data. 
Figure S3). In contrast, in the case of TABl', the intemal hydro- 
gen bonds for the free glycopeptide are associated primarily with 
a conformation that is significantly different from that seen in the 
bound state (RMSD ~4-5 A). Neither histone H3' nor poly- Ala' 
appear to adopt conformations similar to the bound forms, 
however, c-Myc', despite the absence of stable intemal hydrogen 
bonds, also adopts conformations in solution that are similar to 
the bound form (RMSD ~2-5 A). 

Binding free energy analysis 

Per-residue binding iiee energy calculations (molecular mechanics- 
generalized bom surface area (MM-GBSA)) were carried out on 
GlcNAc-OMe in complex with BtOGA, as well as the five glyco- 
peptide-5/OGA complexes. This allowed us to separate and quan- 
tify the contributions made by the GlcNAc and the peptide 
portions of the glycopeptides, as well as to identify which BtOGA 
residues make a significant contribution to glycopeptide affinity. 

BtOGA per-residue binding energy contributions. In agreement 
with the hydrogen bond analysis of the OGA-glycopeptide 
interactions, the MM-GBSA data showed that the five BtOGA 
residues (D344, N339, D242, K166 and H433), which make 
sidechain interactions with the glycan, rank among the highest 
contributors to binding affinity (Figure 4 and Supplementary 
data. Table SV). Additionally, the MM-GBSA analysis for the 
OGA-GlcNAc-OMe complex found that protein residues (W337, 
Y282, N372, Y137, V314 and V342) contribute toward binding 
GlcNAc through a combination of van der Waals and hydrophobic 
interactions. Other residues that were predominantly associated 
with the peptide portion of the glycopeptides were also 
identified; for example D243 (see Figure 4 and Supplementary 
data. Table SV). Notably, residue Y137 showed a markedly greater 
contribution to binding for all glycopeptides, compared with that 
seen for its interaction with GlcNAc-OMe alone. Further, the 
contribution from Y137 varied among the glycopeptides, 
making a significantly stronger interaction with p53' (-8.6 kcal/ 
mol), which may be compared with values for histone H3' 
(-5.5 kcal/mol), c-Myc' (-5.0 kcal/mol), TABl' (-4.5 kcal/mol), 
poly-Ala' (-3.9 kcal/mol) and GlcNAc-OMe (-1.2 kcal/mol). 
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^ 

Fig. 4. The middle stracture from each cluster analysis, of c-Myc' (purple), Histone H3' (red), p53' (blue), poly- Ala' (green) and TABl' (cyan) are shown in complex 
with OGA. The per-residue binding interactions for the OGA residues are represented as a colour scale; from lowest (most attractive) (red) to highest (least attractive) 
(grey) binding interaction energy value. The OGA residues that are most significant to peptide binding are shown for each complex. 



This enhancement stems from hydrogen bonding with the 
glycosylation site Ser or Thr residues, common to each 
glycopeptide, as well as through van der Waals interactions 
with glycopeptide residues at the -2 and -3 positions. 
Similarly, residue W286 contributes to peptide binding in all 
cases, principally through van der Waals interactions with 
residues at the +2 to +4 positions (TABl' (-S.Okcal/ 
mol) > poly- Ala' (-3.7 kcal/mol)>p53' (-3.1 kcal/mol) histone 
H3' (-2.0 kcaVmol) > c-Myc' (-1.3 kca^mol) > GlcNAc-OMe 
(-0.2 kcal/mol)). Residue H433 also contributes to binding in 
all of the peptides; interestingly the interactions with H433 
ranks the glycopeptides in an inverse order relative to that seen 
for W286, namely c-Myc' (-2.6 kcal/mol) > histone H3' (-2.5 
kcal'mol)>p53' (-1.7 kcal/mol) >T4B1' (-1.3 kcal/mol) > poly- 
Ala' (-1.0 kcal/mol) >GlcNAc-OMe (-0.7 kcal/mol). This suggests 
that W286 and H433 counter balance each other in terms of 
peptide specificity. 

Certain S^OGA residues were also found to be of particular 
importance to a subset of glycopeptides (Figure 4). For example, 
residues Y550 (-2.9 kcal/mol), R347 (-2.3 kcal/mol) and S245 
(—2.3 kcal/mol) each contribute significantly to the affinity 



of histone H3'. Residue Y550 interacts favorably with histone 
H3' residue R(-2), primarily through van der Waals interac- 
tions. As might be expected, the interaction between R347 and 
R(-2) is electrostatically repulsive; however, it is predicted to 
be net favorable, as a result of desolvation effects. This suggests 
that mutation of R347, to either D or E, or to an aromatic residue, 
may lead to increased binding specificity for histone H3'. As in 
the case of histone H3', residue Y550 also contributes to the affin- 
ity of p53' (-1.7 kcaFmol), primarily through van der Waals 
interactions with W(-3). Finally, residues S245 (-2.4 kcal/ 
mol) > 1244 (-2.7 kcal/mol) >Q2 10 (-1.8 kcal/mol) are each 
involved in favorable interactions with c-Myc' at E(-4). 



Glycopeptide per-residue binding energy contributions. In 
order to define the contributions from each residue in the five 
glycopeptides, a per-residue decomposition of the MM-GBSA 
data was perfonned (see Table IV for total energy interactions and 
Supplementary data. Table SVI for the per-residue breakdown). 
Specifically, over 50% of the total binding free energy for each of 
the five glycopeptides comes directly from interactions involving 
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Table IV. Total component binding free energies" for each of the five 
glycopeptides. 



iviuiciy 


Van Qer 
waals 


ji!/lectrostQtic 


Polar 

Desolvation 


1 

Desolvation 


Total 

Interaction 


C-Myc' 












GlcNAc 


-13.0 


-45.9 


34.3 


-2.4 


-27.0 (2.2) 


Peptide 


—21.8 


37.9 


—32.5 


-2.9 


—19.1 (6.0) 


Histone H3' 












GlcNAc 


-12.9 


-46.8 


34.2 


-2.3 


-27.9 (1.7) 


Peptide 




— M'\ ft 
1 / J.O 


1 SA 7 
1 oU. / 


J. a 


—lA A (1 'y\ 


p53' 












GlcNAc 


-13.1 


-45.6 


33.6 


-2.3 


-27.4(1.7) 


Peptide 


-31.0 


51.2 


-39.8 


-4.6 


-24.2 (7.0) 


Poly-Ala' 












GlcNAc 


-12.3 


-41.5 


33.2 


-2.3 


-22.9 (4.7) 


Peptide 


-23.8 


-3.8 


12.2 


-3.5 


-18.7(7.6) 


TABl' 












GlcNAc 


-13.5 


-40.4 


32.4 


-2.3 


-23.8(1.9) 


Peptide 


-27.0 


-13.6 


20.4 


-4.2 


-24.4 (7.9) 



"In kcal/mol with standard deviation in parentheses 



0-GlcNAc. The significance of the GlcNAc moiety in driving 
glycopeptide binding has similarly been shown through kinetic 
studies of hOGA with various 0-GlcNAcylated substrates (Shen 
et al. 2012). Interestingly, the binding energy Irom the GlcNAc 
falls into two distinct categories. For the glycopeptides; c-Myc', 
Histone H3' and p53', the GlcNAc contributed essentially 
equivalent interaction energies (-27.0 to -27.9 kcal/mol), whereas 
for the poly-Ala' and TABl' glycopeptides, the glycan contributed 
—22.9 and -23.8 kcal/mol, respectively (Table IV). The loss in 
aiiinity in the latter two cases is a direct result of weakened 
electrostatic interactions (decreased by 4—5 kcal/mol) with the 
glycan. The analysis of the hydrogen bond interactions for these 
systems suggests that this is a result of weaker interactions 
specifically with residues D344 and N339, as indicated by both 
slightly lower occupancy and larger distances of the hydrogen 
bonds (Supplementary data, Table SII). Closer inspection indicates 
that the glycan, in the poly-Ala' and TABl' glycopeptide 
complexes, is more mobile than in the other three complexes; 
the average positional RMSD for the nonhydrogen atoms in 
GlcNAc for the poly- Ala' (0.82 A) and TABl' (0.61 A) 
complexes are shghtly higher than for the GlcNAc in the other 
three complexes (c-Myc': 0.53 A, histone H3': 0.41 A, and p53': 
0.51 A). 

The peptides (not including the GlcNAc) contribute (-18.7 
to -24.4 kcal/mol) to the total interaction energy; or up to ~50% 
of the total binding affinity. As might be expected on the basis of 
relative flexibilities, the interaction energies associated with the 
GlcNAc residues are much less variable than those of the pep- 
tides. These data suggest that peptide residues in the vicinity of 
the glycosylation site (±5 residues) can enhance glycopeptide af- 
finity, and also that loss of the glycan from the peptide, by enzym- 
atic cleavage, would markedly reduce the affinity of the products. 
Notably, as in the case of the GlcNAc residues, the glycosylation 
site contributed similar values in each complex (an average of 
-2.9 ±0.7 kcal/mol), with no significant differences being 
observed for Ser or Thr (c-Myc') residues. In the poly-Ala' refer- 
ence system, the residues in the +1 to +5 positions, relative to the 
glycosylation site, contribute ~-10.0 kcal/mol to the binding 
interaction. In the cases of the other four glycopeptides, residues 



in the +1 to +5 positions make weaker interactions with BtOGA 
(histone H3': -8.0 kcal/mol, p53'; -7.8 kcal/mol, c-Myc': -4.9 
kcal/mol), relative to poly- Ala', with the exception of TABl' 
(TABl': -11.1 kcal/mol). In contrast, relative to poly- Ala' (-6.2 
kcal/mol), residues in the -1 to -4 positions consistently contrib- 
ute significantly to the interaction energies (p53': -16.0 kcal/mol, 
histone H3': -13.0 kcal/mol, c-Myc': -11.5 kcal/mol, TABl': 
-10.3 kcal/mol). 



Discussion 

Here we have extended the simulations to systems for which ex- 
perimental data do not exist. The new structural information 
provides a fundamental understanding of the manner in which 
OGA tolerates variations in peptide substrate sequence. It is 
also important to note that interaction energies are highly sensi- 
tive to the 3D structure of the complex, and for these reasons 
we have placed considerable emphasis on validating and inter- 
preting the structural properties of the fi/OGA-glycopeptide 
complexes. 

The computational analyses indicate that each of the five gly- 
copeptides can adopt comparable conformations when bound 
to the enzyme and that the interaction energies of each complex 
were remarkabley similar, spanning a range of <5 kcal/mol. 
The bound complexes are in good agreement with recently 
reported data for the complex between the bacterial OGA 
homolog from C. perfringens (QjQGA) in complex with an 
O-GlcNAcylated p53 peptide (PDBid 2YDR) (Schimpl et al. 
2012). The fact that poly-Ala' behaved similarly to the other 
four natural glycopeptides suggests that the interactions with 
BtOGA are structurally nonspecific. The occupancy analysis of 
the hydrogen bond network revealed that none of the peptides 
formed stable inter- or intramolecular hydrogen bonds. This is 
not unexpected, given the highly dynamic nature of the pep- 
tides. Nevertheless, the analyses suggested that the "V"-shaped 
conformation of each of the glycopeptides was stabilized to 
greater or lesser extents by weak intramolecular interactions 
spanning the glycosylation site, as well as by a limited number 
of intermolecular interactions. 

Although stacking between Y137 and the peptide backbone 
was noted by Shimpl et al., no such stacking was observed 
during the simulations of the 5/OGA glycopeptide complexes 
(Supplementary data. Table SVII). Rather, the interaction energy 
associated with Y137 stems from less-ordered vdW contacts with 
the glycopeptides. The lack of stacking during the MD simula- 
tions indicates that there is no hydrophobic or van der Waals 
driving force for such an orientation. The lack of a stable stacked 
geometry might also arise from the inability of the classical force 
field to capture effects arising from tz ioK orbital overlap. 

A per-residue analysis of the energetic contributions of BtOGA 
to binding, fi-om both hydrogen bonding and nonpolar interac- 
tions, indicated that a small subset of residues (D344, N339, 
D242, K166, H433, W337, Y282, N372, Y137, V314 and V342) 
is responsible for binding the glycan. Similarly, the analysis indi- 
cated that four 5/OGA residues (Y137>D243 W286>H433) 
were predominantly responsible for binding tlie peptide portion 
for all of the glycopeptides. Our results are in good agreement 
with the experimentally observed loss of enzymatic activity 
observed for BtOGA mutants (D242A/N, D243A/N, N372A, 
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Y137F and Y282F) (Dennis et al. 2006), as well as CpOGA 
mutant D401A {BtOGA D344) (Rao et al. 2006). It has also been 
reported that mutation of the conserved Y69 in hOGA (5/OGA 
Y 137) to any residue other than F led to a loss in enzymatic activ- 
ity (Schimpl et al. 2012). Given that the remaining peptide resi- 
dues contribute to afiSnity through both backbone and sidechain 
interactions, it is unclear to what extent mutations at these posi- 
tions would affect affinity or modulate enzyme Ky„ values. Tlie 
per-residue analysis also identified residues which contributed spe- 
cffically to aflinity for each of the glycopeptides. Such data enable 
the prediction of sites whose mutation may lead to significant pre- 
ferences for one substrate over another For example, mutation of 
_SOGA R347 to either a negatively charged, or aromatic, residue 
could potentially lead to an increased affinity and specificity for 
histone H3'. 

Taken together, the MM-GBSA data indicate that the affinity 
for substrate varies as a fiinction of peptide sequence, despite 
the fact that there are no strong recognition motifs associated with 
any of the peptides. Tliis observation would be expected to lead to 
variations in the enzyme values, as observed experimentally 
(Schimpl et al. 2012). Consistent with interpretations of the crystal 
structures (Schimpl et al. 2012), affinity contributions are made by 
peptide residues near the GlcNAc site, but here we also see signifi- 
cant contributions from residues as distal as the -4 position. 
Although the glycopeptides all adopted grossly similar "V"-shaped 
conformations, none displayed persistent internal hydrogen bonds. 
Thus it is uncertain to what extent intemal structuring of the 
peptide contributes to the affinity of the interactions, or to the of 
the enzyme, as has been proposed (Schimpl et al. 2012). 

Conclusion 

Here we have presented an atomic-level analysis of the structure 
and dynamics of the binding interaction between a bacterial 
OGA homolog in complex with five GlcNAcylated peptides of 
biological interest. The primary motivation for the research was 
to develop an understanding of the unique lack of substrate se- 
quence specificity displayed by OGA. The MD simulations 
show that despite the differences in sequence, all of the glyco- 
peptides adopted a similar conformation when bound to OGA, 
consistent with limited experimental data (Schimpl et al. 2012), 
but nevertheless retain considerable intemal flexibility. This 
flexibility is also thought to occur in the region spanning the 
sites of known 0-GlcNAcylated proteins, and as such there 
are currently no protein structures available, with the excep- 
tion of p53 (PDbid ITUP), where these regions are fully 
defined (Schimpl et al. 2012). This observation has implica- 
tions not only for the action of the hydrolase OGA, but also 
potentially for the specificity of its cognate partner, the trans- 
ferase OGT 

The theoretical approach taken here enabled us to quantify the 
relative importance of specific and nonspecific interactions. The 
data show that the specificity of OGA for the GlcNAcylated pep- 
tides arises primarily from strong conserved interactions with the 
glycan; nevertheless, nonspecific interactions with the peptidic 
component of the substrates contribute signfficantly to affinity. 
This observation provides a clear explanation for the ability of 
this enzyme to tolerate a range of polymorphisms in the vicinity 
of the glycosylation site in the substrate. The analysis also indi- 
cates that variations in observed AT^ values (Schimpl et al. 2012) 



are a natural consequence of affinity contributions from non- 
specific interactions with the substrate. The potential exists there- 
fore to employ the present analysis to guide the development 
glycopeptide-specffic inliibitors, or conversely, the conversion of 
OGA into a reagent that could target specific 0-GlcNAcylated 
peptide sequences. 



Computational methods 

MD simulations 

All MD simulations were carried out using version 4.04 of the 
Groningen machine for chemical simulations (GROMACS) 
(Hess et al. 2008) or assisted model building with energy refine- 
ment (AMBER)12 (Case et al. 2005) software packages. All 
calculations employed the AMBER99SB (Comell et al. 1995) 
parameters for the protein augmented by the glycans and glyco- 
proteins in AMBER (GLYCAM)06g (Kirschner et al. 2008) 
parameters for the carbohydrate. The total charge was neutrahzed 
by addition of expHcit Na^ or CI counter ions as required. Each 
system was solvated with TIP3P water (Jorgensen et al. 1983) in 
a cubic simulation box (120 A per side). The positions of all 
water molecules, counterions and all hydrogen atoms were opti- 
mized through 1 0,000 steps of steepest descent energy minimiza- 
tion. During energy minimization and MD equilibration the 
nonhydrogen atoms were restrained with a force constant of 
lOOOkJ/mol/nm^ 

A step-wise protocol was employed for MD equilibration be- 
girming with heating in the NVT ensemble to 300 K over 100 ps. 
Subsequently, water, hydrogen atoms and counter ions were equi- 
librated for a fiarther 200 ps in the NPT ensemble at 300 K at a 
pressure of 1 bar Restraints were then removed from the side- 
chains of the protein-ligand complexes and the system equili- 
brated for a fiirther 1 ns. During the final equilibration phase (2 
ns), all atoms were released from positional restraints. In all simu- 
lations, the linear constraint solver (LINCS) was used to constrain 
all hydrogen-containing molecules (Hess et al. 1997), permitting 
an integration step of 2 fs. The temperature was held constant at 
300 K by a Langevin themiostat with a coupling time constant of 
0.1 ps. Elecfrostatic interactions were treated with the Particle- 
Mesh Ewald (PME) algorithm (Darden et al. 1993). Cut-off 
values for Coulomb interactions were set to 12 A, while van der 
Waals interactions were switched off between 10 and 1 1 A. 

Protein coordinates were obtained from the crystal structure of 
the bacterial OGA fix)m B. thetaiotaomicron {BtOGA), in complex 
with the GlcNAc transition state mimic inhibitor PUGNAc 
(PDBid: 2WS) (Macauley et al. 2008). The N-terminal domain 
between L5 and P125, located between 28 A and 54 A from the 
binding site, was removed to reduce the computational demand. 
The N- and C-terminal residues (Serl26 and Ala586) were capped 
with acetyl (ACE) and A^methyl (NME) groups, respectively. 
Aside fix)m residue D243 (D 1 75 in hOGA), which is known to be 
protonated ((^etinbaj et al. 2006), the protonation state of all other 
ionizable residues was chosen to reflect their standard proton- 
ation at pH 7. All crystallographic water molecules were 
removed and missing hydrogen atoms were added to the struc- 
ture using pdb2gmx (Feenstra et al. 1999). Where possible, 
protonation of His residues at the - or 8-nitrogen atoms was 
based on optimal hydrogen bonding interactions; otherwise a 
default of His-8 was assumed. 
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Generation of the glycopeptide-BtOGA complexes 
The conformation of the bound native GlcNAc hgand was un- 
ambiguously detemiined based on the PUGNAc coordinates. It 
has previously been shown that the common a«//-conformation 
of the NAc group in PUGNAc does not permit it to fit into the 
binding site of the BtOGA (Rao et al. 2006), thus the NAc group 
of the GlcNAc residue was rotated iiom anti to syn, comparable to 
the orientation seen for the oxime in the fi^OGA-PUGNAc 
complex (Figure 5 A). Syn and anti refer to values of 0° and 180° 
with respect to the H2-C2-C-N torsion angle; however, as hydro- 
gen atoms are not routinely reported in crystal structures, these 
orientations have altematively been defined here in terms of 
C1-C2-N-C for which = -60° andanti^ 120°. A 3D structure 
for GlcNAc-P-O-Ser was built using the GLYCAM webtool 
(Woods 2005-2013) and modified using Maestro (Maestro 2011) 
to generate three rotamers of the serine side chain (ji = 40.7, 
170.6, and -49.1°, see Figure 5B), as identified from the 
Dynameomics database (Day et al. 2003; Beck et al. 2008; Kehl 
et al. 2008; Simms et al. 2008). The N- and C-terminal residue was 
capped for all glycopeptides with an acetyl (ACE) or A'-methyl 
(NME) group, respectively. To determine the preferred conform- 
ation of the GlcNAc-P-O-Ser linkage, tliree initial MD simulations 
of this glycopeptide in complex with BtOGA were carried out. 
Each simulation was initiated with the angle of the Ser sidechain 
in one of the three possible stable rotamers. 

Following MD simulations, the most populated GlcNAc-P- 
0-Ser rotamer was identified {<^/^/zi ~-61.3/-179.5/-77.9°; see 
Figure 5C and Supplementary data. Figure 1), and used to define 
an initial orientation of the GlcNAc moiety relative to the peptide 
backbone for use in docking each of the GlcNAcylated peptides 
(c-Myc', histone H3', p53', TABl' and a poly-Ala') to BtOGA. 

Molecular docking was performed with Autodock Vina (AD 
Vina) (Trott and Olson 2010), employing the iterated local 
search global optimizer algorithm (Baxter 1981; Abagyan et al. 
1994). Structures for the five fiilly extended glycopeptides were 
generated using Maestro (Maestro 2011), and docked into the 
BtOGA structure, while maintaining the GlcNAc-P-O-Ser 
linkage at the preferred rotamer values, as defined above. All 
other backbone and sidechain torsion angles in the glycopep- 
tides were allowed to be flexible. The graphical user interface 
program AutoDock Tools (Sanner 1999) was used to prepare 
the structures for all docking simulations. To prepare the 
5^0GA crystal structure for docking, crystallographic water 
molecules, as well as the co-crystallized inhibitor, PUGNAc, 



were removed. Polar hydrogen atoms were added where neces- 
sary with AutoDock Tools. The docking region was selected by 
centering a grid box (35 A per side) on the Ca atom of the nu- 
cleophile residue D243. The optimal glycopeptide conform- 
ation from each of the five dockings was determined by 
searching the ranked docking poses for the lowest energy struc- 
ture; in which the GlcNAc was placed in a pose comparable 
with that seen in the MD simulations of the GlcNAc-P-O-Ser 
complex (Figure IB). The five fi/OGA-glycopeptide com- 
plexes were employed for subsequent MD simulations. 
Simulations of the free glycopeptides were initiated from the 
most populated conformation of the bound glycopeptides. 



Analysis 

The conformational properties of each of the five peptides was 
analyzed in terms of RMSD of the peptide backbone atoms 
(Ca, N, C, O) employing visual molecular dynamics (VMD) 
(Humphrey et al. 1996). The RMSD values for each peptide was 
calculated relative to the first time step of the MD simulation, 
and also with reference to the experimental coordinates reported 
recently for the p53 glycopeptide bound to CpOGA. The 
Cremer-Pople-puckering parameters (Cremer and Pople 1975) 
of the GlcNAc ring were monitored throughout each MD using 
the GROMACS (Hess et al. 2008) tool g_puckering. Clustering 
analyses of the peptide confonnations were performed using the 
GROMACS tool g_c luster, employing the Gromos Algorithm 
(Daura et al. 1999) using an RMSD cut-off of 1.0 A. Stacking 
interactions were monitored using the GROMACS (Hess et al. 
2008) tool g_sgangle to measure the angle and distance between 
two specified groups. An analysis of the hydrogen bond network 
was carried out for each complex using VMD (Humphrey et al. 
1996) with a distance cut-off of 3.5 A and a minimal occupancy 
of 5%. Binding free energies were calculated with the single- 
trajectory MM-GBSA method (KoUman et al. 2000, Srinivasan 
et al. 1998), with average values computed from an ensemble of 
100 uncorrelated snapshots collected every 3.5 ns from the 350 
ns trajectory. Before the analyses, all water molecules were 
removed from each complex, and the solvation energy approxi- 
mated through the generalized Bom (GB) implicit solvation 
model (igb = 2) (Onufriev et al. 2004). This approach has been 
applied by us (Kadirvelraj et al. 2006, Kadirvelraj et al. 2011) 
and others (Bryce et al. 2001) for the analyses of carbohydrate- 
protein afiinity. 




Fig. 5. (A) P-OMe-GlcNAc(z), shown as sticks with pink carbons, was superimposed onto PUGNAc, shown as grey sticks, in the OGA (PDBid 2WS) binding site, 
shown as grey cartoon. (B) The tliree p-Ser-O-GlcNAc(z) g+ (green), t (cyan), g- (pink) rotamers, prior to MD. C) P-Ser-O-GIcNAc(z) rotamers after 10 ns 
simulation. 
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Supplementary data for this article is available online at http:// 
gly cob . oxfordj oumals .org/. 
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0-GlcNAc hydrolase; OGT, 0-GlcNAc transferase; PDB, Protein 
Data Bank; PME, Particle-Mesh Ewald; GROMACS, Groningen 
machine for chemical simulations; VMD, visual molecular 
dynamics; OMe, 0-methyl. 

References 

Abagyan R, Totrov M, Kuznetsov D. 1994. ICM — A new method for protein 
modeling and design: Applications to docking and structure prediction from 
the distorted native conformation. J Coni/)!;^ Chem. 15:488-506. 

Baxter J. 1981. Local optima avoidance in depot location. J Oper Res Soc. 
32:815-819. 

Beck DA, Jonsson AL, Schaeffer RD, Scott KA, Day R, Toofanny RD, Alonso 

DO, Daggett V. 2008. Dynameomics: mass annotation of protein dynamics 

and unfolding in water by high-throughput atomistic molecular dynamics 

simulations. Protein EngDes Set. 21:353-368. 
Bryce RA, Hillier IH, Naismith JH. 2001. Carbohydrate-protein recognition: 

molecular dynamics simulations and free energy analysis of oligosaccharide 

binding to concanavalin A. Biophys J. 81:1373-1388. 
Case DA, Cheatham 111 TE, Darden TA, Gohlke H, Luo R, Merz KM, Onufriev 

A, Simmerling CL, Wang B, Woods RJ. 2005. The AMBER biomolecular 

simulation pcogmms,. J Comput Chem. 26:1668-1688. 
Cetinba? N, Macauley MS, Stubbs KA, Drapala R, Vocadlo DJ. 2006. 

Identification of Asp 1 74 and Asp 1 75 as the key catalytic residues of human 

0-GlcNAcase by functional analysis of site-directed mutants. Biochemistry 

(Mosc). 45:3835-3844. 
Cheng X, Cole RN, Zaia J, Hart GW. 2000. Alternative 0-glycosylation/ 

0-phosphorylation of the murine estrogen receptor p. Biochemistry (Mosc). 

39:11609-11620. 



Cho Y, Gorina S, Jeffrey PD, Pavletich NR 1994. Crystal structure of a p53 

tumor suppressor-DNA complex: understanding tumorigenic mutations. 

Science. 265:346-355. 
Chou T-Y, Hart GW, Dang CV. 1995. c-Myc is glycosylated at threonine 58, a 

known phosphorylation site and a mutational hot spot in lymphomas. J Bio! 

C/iem. 270:18961-18965. 
Comer Fl, Hart GW. 2000. O-Glycosylation of nuclear and cytosolic proteins: 

Dynamic interplay between 0-GlcNAc and 0-phosphate. J Biol Chem. 

275:29179-29182. 

Cornell WD, Cieplak P, Bayly CI, Gould IR, Merz KM, Ferguson DM, 
Spellmeyer DC, Fox T, Caldwell JW, Kollman PA. 1995. A second gener- 
ation force field for the simulation of proteins, nucleic acids, and organic 
molecules. /^m Chem Soc. 117:5179-5197. 

Cremer D, Pople JA. 1975. General definition of ring puckering coordinates. 
J Am Chem Soc. 97:1354-1358. 

Darden T, York D, Pedersen L. 1993. Particle mesh Ewald: An N [center-dot] 
log(N) method for Ewald sums in large systems. / Chem Phys. 
98:10089-10092. 

Daura X, Gademann K, Jaun B, Seebach D, van Gunsteren WF, Mark AE. 
1999. Peptide folding: When simulation meets experiment. Angew Chem bit 
Ed. 38:236-240. 

Day R, Beck DA, Arnien RS, Daggett V 2003. A consensus view of fold space: 
combining SCOP, CATH, and the Dali Domain Dictionary. Protein Sci. 
12:2150-2160. 

Dennis RJ, Taylor EJ, Macauley MS, Sfi-ibbs KA, Turkenburg JP, Hart SJ, 
Black GN, Vocadlo DJ, Davies GJ. 2006. Structure and mechanism of a 
bacterial [beta]-glucosaminidase having 0-GlcNAcase activity. Nat Struct 
MolBiol. 13:365-371. 

Feenstra KA, Hess B, Berendsen HJC. 1999. hnproving efficiency of lai'ge time- 
scale molecular dynamics simulations of hydrogen-rich systems. J Comput 
Chem. 20:786-798. 

Hanover JA. 2001. Glycan-dependent signaling: O-Unked N-acetylglucosamine. 

FASEBJ. 15:1865-1876. 
Hanover JA, Krause MW, Love DC. 2010. The hexosamine signaling pathway: 

0-GlcNAc cycling in feast or famine. Biochim Biophys. 1 800:80-95. 
Hart GW, Housley MP, Slawson C. 2007. Cycling of 0-linked [beta]- 

N-acetylglucosamine on nucleocytoplasmic proteins. Nature. 446:1017-1022. 
He Y, Martinez-Fleites C, Bubb A, Gloster TM, Davies GJ. 2009. Strachiral 

insight into the mechanism of streptozotocin inhibition of O-GlcNAcase. 

CarhohydrRes. 344:627-631. 
Henrissat B, Davies G. 1997. Structural and sequence-based classification of 

glycoside hydrolases. Cwr Opin Struct Biol. I'.b'il-b^A. 
Hess B, Bekker H, Berendsen HJC, Fraaije JGEM. 1997. LINCS: A linear 

constraint solver for molecular simulations. / Comput Chem. 

18:1463-1472. 

Hess B, Kutzner C, van der Spoel D, Lindahl E. 2008. GROMACS 4: 
Algorithms for highly efficient, load-balanced, and scalable molecular simu- 
lation. J C/iem Theory Comput. 4:435^47. 

Humphrey W, Dalke A, Schulten K. 1996. VMD: Visual molecular dynamics. 
JMol Graph. 14:33-38. 

Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. 1983. 
Comparison of simple potential fimctions for simulating hquid water J Chem 
Phys. 79:926-935. 

Kadireelraj R, Gonzalez-Outerifio J, Foley BL, Beckliam ML, Jennings HJ, 
Foote S, Ford MG, Woods RJ. 2006. Understanding the bacterial polysac- 
charide antigenicity of Streptococcus agalactiae versus Streptococcus 
pneumoniae. PNAS. 103:8149-8154. 

Kadireelraj R, Grant OC, Goldstein IJ, Winter HC, Tateno H, Fadda E, Woods 
RJ. 2011. Structure and binding analysis of Polyporus squamosus lectin in 
complex with the Neu5Aca2-6Galpl-4GlcNAc human-type influenza 
receptor. Glycohiology. 21:973-984. 

Kamemura K, Hayes BK, Comer Fl, Hart GW. 2002. Dynamic interplay 
between 0-glycosylation andO-phosphorylation of nucleocytoplasmic 
proteins: Alternative glycosylation/phosphorylation of Thr-58, a known mu- 
tational hot spot of c-Myc in lymphomas, is regulated by mitogens. J Biol 
C//em. 277:19229-19235. 

Kehl C, Simms AM, Toofanny RD, Daggett V. 2008. Dynameomics: a multi- 
dimensional analysis-optimized database for dynamic protein data. Protein 
EngDesSel. 21:379-386. 

Kim EJ, Pen'eira M, Thomas CJ, Hanover JA. 2006. An O-GlcNAcase-specific 
inhibitor and subsfrate engineered by the extension of the N-acetyl moiety. 
J Am Chem Soc. 128:4234^235. 



95 



JC Martin et al. 



Kirschner KN, Yongye AB, Tschampel SM, Gonzalez-Outeirino J, Daniels CR, 
Foley BL, Woods RJ. 2008. GLYCAM06: A generalizable biomolecular 
force field. Carbohydrates. J Comput Chem. 29:622-655. 

Kollman PA, Massova I, Reyes C, Kuhn B, Huo S, Chong L, Lee M, Lee T, 
Duan Y, Wang W, et al. 2000. Calculating structures and free energies of 
complex molecules: Combining molecular mechanics and continuum 
models. Acc Chem Res. 33:889-897. 

Macauley MS, Bubb AK, Martinez-Fleites C, Davies GJ, Vocadlo DJ. 2008. 
Elevation of global 0-GlcNAc levels in 3T3-L1 adipocytes by selective in- 
hibition of 0-GlcNAcase does not induce insulin resistance. J Biol Chem. 
283:34687-34695. 

Macauley MS, Vocadlo DJ. 2010. Increasing 0-GlcNAc levels: An overview of 
small-molecule inhibitors of 0-GlcNAcase. Biochim Biophys Acta. 
1800:107-121. 

Macauley MS, Whitworth GE, Debowski AW, Chin D, Vocadlo DJ. 2005. 
0-GlcNAcase uses substrate-assisted catalysis. J Biol Chem. 280:25313- 
25322. 

Maestro. 201 1. Suite 2011: Maestro, version 9.2, Schrodinger, LLC: New York, 
NY, 2011. 

Onufi'iev A, Bashford D, Case DA. 2004. Exploring protein native states and 

large-scale confonnational changes with a modified generalized bom model. 

Proteins. 55:383-394. 
Pathak S, Borodkin VS, Albarbarawi O, Campbell DG, Ibrahim A, van Aalten 

DM. 2012. O-GlcNAcylation of TABl modulates TAKl-mediated cytokine 

release. EMBO J. 3 1 : 1394-1404. 
Perreira M, Kim EJ, Thomas CJ, Hanover JA. 2006. Inhibition of 

0-GlcNAcase by PUGNAc is dependent upon the oxime stereochemistry. 

Bioorg Med Chem. 14:837-846. 
Rao FY, Dorfinueller HC, Villa F, Allwood M, Eggleston IM, van Aalten DMF 

2006. Structural insights into the mechanism and inhibition of eukaryotic 

0-GlcNAc hydrolysis. EMBO J. 25:1569-1578. 
Sanner MF. 1999. Python: A programming language for software integration 

and development. JMol Graph Model. 17:57-61. 



Schimpl M, Borodkin VS, Gray LJ, van Aalten DMF 2012. Synergy of 
peptide and sugar in O-GlcNAcase substrate recognition. Chem Biol. 
19:173-178. 

Shen DL, Gloster TM, Yuzwa SA, Vocadlo DJ. 2012. Insights into O-GlcNAc 
processing and dynanucs through kinetic analysis of 0-GlcNAc transferase and 
0-GlcNAcase activity on protein substrates. J Biol Chem. 287: 15395-15408. 

Simms AM, Toofanny RD, Kehl C, Benson NC, Daggett V 2008. 
Dynameomics: design of a computational lab workflow and scientific data re- 
pository for protein simulations. Protein EngDes Sel. 21:369-377. 

Slawson C, Copeland RJ, Halt GW. 2010. 0-GlcNAc signaling: a metabolic 
link between diabetes and cancer?. Trends Biochem Sci. 35:547-555. 

Srinivasan J, Miller J, Kollman PA, Case DA. 1998. Continuum solvent studies of 
the stability of RNA hairpin loops and helices. JBiomol StirictD)^. 16:671-682. 

Torres CR, Hart GW. 1984. Topography and polypeptide distribution of tennin- 
al N-acetylglucosamine residues on the surfaces of intact lymphocytes. 
Evidence for 0-linked GlcNAc. J Biol Chem. 259:3308-33 17. 

Trott O, Olson AJ. 2010. AutoDock Vina: Improving the speed and accuracy of 
docking with a new scoring function, efficient optimization, and multithread- 
ing. J Comput Chem. 3 1 :455^61 . 

Whitworth GE, Macauley MS, Stubbs KA, Dennis RJ, Taylor EJ, Davies GJ, Greig 
IR, Vocadlo DJ. 2006. Analysis of PUGNAc and NAG-thiazoline as transition 
state analogues for human 0-GlcNAcase: Mechanistic and structural insights 
into inhibitor selectivity and transition state poise. J Am Chem Soc. 129:635-644. 

Woods G. 2005-2013. GLYCAM Web. 

Yang WH, Kim JE, Nam HW, Ju JW, Kim HS, Kim YS, Cho TW. 2006. 
Modification of p53 with O-linked N-acetylglucosamine regulates p53 activ- 
ity and stability. Nat Celt Biol. 8: 1074-1083. 

Zeidan Q, Hart GW. 2010. TTie intersections between 0-GlcNAcylation and phos- 
phorylation: implications for multiple signaling pathways. J Cell Sci. 123:13-22. 

Zhang S, Roche K, Nasheuer H-P, Lowndes NF. 2011. Modification of histones 
by sugar P-N-Acetylglucosamine (GlcNAc) occurs on multiple residues, in- 
cluding histone H3 serine 10, and is cell cycle-regulated. J Biol Chem. 
286:37483-37495. 



96 



